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Abstract — Interference alignment (lA) is a technique recently 
shown to achieve the maximum degrees of freedom (DoF) of K- 
user interference channel. In this paper, we focus on the precoder 
designs on manifolds for lA. By restricting the optimization only 
at the transmitters' side, it will alleviate the overhead induced by 
alternation between the forward and reverse links significantly. 
Firstly a classical steepest descent (SD) algorithm in multi- 
dimensional complex space is proposed to achieve feasible lA. 
Then we reform the optimization problem on Stiefel manifold, 
and propose a novel SD algorithm based on this manifold 
with lower dimensions. Moreover, aiming at further reducing 
the complexity, Grassmann manifold is introduced to derive 
corresponding algorithm for reaching the perfect lA. Numerical 
simulations show that the proposed algorithms on manifolds have 
better convergence performance and higher system capacity than 
previous methods, also achieve the maximum DoF. 

I. Introduction 

Interference alignment (lA) is a technique recently proposed 
in im, im and |I3], which can achieve much higher wireless 
networks capacity than previously believed |4|. For the K-usei 
M X M multiple-input multiple-output (MIMO) interference 
channel, the sum capacity is 

Csurn = ^ log(l + SNR) + o{log{SNR)) (1) 

with achievability of KM/2 degrees of freedom (DoF), which 
is defined as 

Csum 



DoF 



lim 



(2) 



SNR^oo log{SNR) 

It means each transmitter-receiver pair is able to communicate 
with achievability DoF of A//2, irrespective of the number of 
interferers. 

A feasible way to align interference is to design such a 
precoder that coordinates transmitting directions, in order to 
force the interference overlapping at each receiver, and reserve 
half of the signaling space interference-free for the desired 
signals. Based on the assumption of channel reciprocity, some 
previous works such as Q — ifTOl iteratively optimize both 
the precoder matrices and interference suppression filters, by 
alternating between the forward and reverse links to achieve 
interference alignment in a distributed way. However, with 
the assumption of channel reciprocity, the applicability of 
these algorithms are restricted within TDD systems only. 
Moreover, alternation between the forward and reverse links 
needs tight synchronization and feedback at each node, which 



may introduce too much overhead when the channel varies 
quickly. Furthermore, the transmitters and receivers exchange 
their "roles" during each iteration of optimization. Thus this 
scheme is inappropriate for the receivers with limited comput- 
ing ability. 

On the other hand, most of the previous works above 
employ traditional constrained optimization techniques that 
work in high dimensional complex space. Unavoidably, several 
shortcomings are accompanied with the traditional constrained 
optimization techniques such as low-converging speed and 
high-complexity. 

To overcome these limitations, in this paper, we introduce 
optimization on matrix manifolds into the precoder design 
for interference alignment, and limit the optimization only at 
the transmitters' side. Optimization algorithms on manifolds 
consist the merits of lower complexity and better numerical 
properties. Firstly for the sake of comparison, by employing 
classical constrained optimization method, a steepest descent 
(SD) algorithm in multi-dimensional complex space is pro- 
vided to design the precoder of interference alignment. Then 
we reformulate the constrained optimization problem to an 
unconstrained and non-degraded one on the complex Stiefel 
manifold with lower dimensions. We locally parameterize the 
manifold by Euclidean projection from the tangent space onto 
the manifold instead of the traditional method by moving 
descent step along the geodesic in [111 and fTS]. Thus the SD 
algorithm on Stiefel manifold is proposed to achieve feasible 
interference alignment. To further reduce the computation 
complexity in terms of dimensions of manifold, we explore 
the unitary invariance property of our cost function, and solve 
the optimization problem on the complex Grassmann mani- 
fold, then present corresponding SD algorithm on Grassmann 
manifold for interference alignment. 

We not only generalize optimization algorithm on mani- 
folds, but also turn the algorithm into an efficient numerical 
procedure to achieve perfect interference alignment. Moreover 
by limiting the optimization algorithms performed at the 
transmitters' side only, all the three proposed algorithms are 
transparent at the receivers. Additionally, overhead generated 
by synchronization and feedback no longer exits since only 
transmitters participate in the iteration. Besides, by relaxing 
the assumption of channel reciprocity, our algorithms are 
applicable to both TDD and FDD systems. Furthermore, 
numerical simulation shows that the novel algorithms on 



manifolds have better convergence performance and higher 
system capacity than previous methods. Finally, we prove the 
convergence of the proposed algorithms. 

The paper is organized as following. Interference channel 
is briefly summarized in Section II and the mathematical 
model of interference alignment is presented in Section III, 
followed by the detailed procedures of all three proposed 
SD algorithms for interference alignment in Section IV. In 
section V numerical simulations and corresponding discussion 
are stated. And the conclusion is given in the last section. 

Notation: We use bold uppercase letters for matrices or 
vectors. and denote the transpose and the conjugate 
transpose (Hermitian) of the matrix X respectively. Assum- 
ing the eigenvalues of a matrix X and their corresponding 
eigenvectors are sorted in ascending order, denotes the i*-^ 
eigenvalue of the matrix X. Then / represents the identity 
matrix. Moreover tr{-) indicates the trace operation. And the 

Euclidean norm of X is ||X|| ^tr{X^X). [X\ denotes 
the subspace spanned by the columns of X. C"^^ represents 
the n X p dimensional complex space assuming n > p. 
M+ represents positive real number space. and ^l-} 

denote the real and imaginary parts of a complex quantity, 
respectively. Finally k — {!,..., K} is the set of integers from 
1 to K. 

II. System model 

Consider the K-user wireless MIMO interference channel 
depicted in Fig. [T] where each transmitter and receiver are 
equipped with A/I*^! and N^''^ antennas respectively. Each 
transmitter communicates with its corresponding receiver, and 
creates interference to all the other receivers, d^'^l is the desired 
number of data streams between the k*^ transmitter-receiver 
pair Additionally, ffl'^^l denotes the A^l'^l xilfl^l channel coef- 
ficients matrix from the j*'' transmitter to the k*'' receiver, and 
is assumed to have i.i.d. complex Gaussian random variables, 
drawn from a continuous distribution. Finally the received 
signal vector at receiver k after zero-forcing the interference 
is denoted by 
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Fig. 1. K-user MIMO interference cliannel 



receiver Therefore interference alignment can be achieved by 
progressively reducing the power of leakage interference. And 
if interference alignment is feasible, the leakage interference 
can eventually be coordinated to zero. From I5j, it can be 
obtained that the d!'''! -dimensional received signal subspace 
that contains the least interference is the space spanned by 
the eigenvectors corresponding to the ojl'^l -smallest eigenvalues 
of the interference covariance matrix Q''''. Consequently, we 
try to minimize the sum of interference power spilled to 
the desired signal subspaces, by minimizing the sum of the 
absolute value of dl*^' -smallest eigenvalues of the interference 
covariance matrix at each receiver to create d^*^! -dimensional 
interference-free subspace for the desired signal. 



B. Cost Function 

As previously stated, we try to minimize the sum of the 
rf!'''! -smallest eigenvalues (in absolute value) of the interference 
covariance matrix at each receiver, over the set of precoder 
matrices V^^\...,V^^^ H^. Therefore, we define the cost 
function as: 



,k e 



(3) 

where each element of the d^^l x 1 vector S''-'' represents an 
independently encoded Gaussian symbol with power pl^l /rfl^l 
that beamformed with the corresponding x rfl^l precoder 
matrix V^-'\ and then transmitted by the transmitter j. t/'*^' is 
the X (/[''l interference zero-forcing filter at the receiver 
k. And VF'*^' is the i.i.d. complex Gaussian noise with zero 
mean unit variance. 

III. Mathematical Model 

A. Feasibility of Interference Alignment 

The quality of alignment is measured by the interference 
power remaining in the intended signal subspace at each 
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(4) 



(5) 



is the interference covariance matrix at receiver k. With the 
assumption that all the eigenvalues are sorted in ascending 
order, Al,[fc] represents the i*'' eigenvalue of the corresponding 



interference covariance matrix Q''^'. And because Q'*^' is a 
Hermitian matrix, all its eigenvalues are real. Therefore, the 
cost function /(V), / : C"^p ^ M+ is built. 



IV. Algorithms on Different Topologies for 
Interference Alignment 

A. The Steepest Descent Algorithm in Complex Space for lA 

Since our cost function: f{V), f : C"^^ ^ R+ is 
differentiable, intuitively the steepest descent method can be 
employed to make the cost function converge to a local optimal 
point efficiently. As well known, the SD method typically can 
be iteratively processed as: by adding an update increment to 
the previous iterate in order to reduce the cost function; the 
update direction and step size are generally computed using 
a local model of the cost function, typically based on first 
derivatives of the cost function. Therefore we will first find 
the closed-form expression of the steepest descent direction in 
C"^'', then employ a property step size rule for each iteration. 

As previously stated, the steepest descent method is tightly 
related to derivative and differentiation. In order to get the 
derivative of f{V) over V, two Jacobian matrices blocks are 
employed as: 
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where V 



= K{V-[^1}, and V^p = D^' and 

il>j' are the d'-'l x M^-'l Jacobian matrices which denote 
the partial differential relation of the cost function over the 
real and imaginary parts of v'^' respectively. The detail of 
mathematical derivations can be found in ifTsl and lfT4l . Thus, 
the derivative of / over V'^' is given by 



.b1 
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(6) 
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(7) 



The inner product typically defined in the Euclidean multi- 
dimensional space is given as: 



(8) 



Then, under the given inner product, the steepest descent 
direction is: 



D 



b1 
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(9) 



Once the formulation of steepest descent direction Z'-*' is 
defined in (|9|l, it is necessary to choose a suitable positive 
step size /jl^l for each iteration. The Armijo step size rule 
IITSl states that /Sl-'l should be chosen to satisfy the following 
inequalities: 

fiV) - fiV + /3ZM) > (Zl^l, ZM) (10) 
/(F) -/(V + 2/3Z[^'l) < ^[^1 {Jz[^l,Z[^lJ) (11) 

Rule ( [Tol l guarantees that the step /jl^lz'-'' will expressively 
decrease the cost function, whereas ( fTTT i undertakes that the 
step 2/3[^l Z'-*' would not be a better choice. A direct procedure 
for acquiring a suitable /jl^l is to keep on doubling /jl^l until 



(fTTT i no longer holds and then halving /jt-'l until it satisfies ( fTOl i. 
It can be proved that such /jl-'l can always be found |16|. 

Consolidating all the ideas stated above, we present our 
algorithm in Algorithm [T] In Step 3 and Step 4, the Armijo 
step rule is performed to find a proper convergence step length. 
Generally speaking. Step 3 ensures the chosen step /jl^l will 
significantly reduce the cost function while Step 4 prevents /S'^l 
from being too large that may miss the potential optimal point. 
The operator gs{-) means Gram-Schmidt Orthogonalization 
ifTTl of a matrix, which guarantees the newly computed solu- 
tion V'-'' (or B^^\ B^^ ) still satisfies the unitary constraint. 



Algorithm 1 The Steepest Descent Algorithm in Complex 
Space for I A 

Start with arbitrary precoder matrices V'^' , V'^', set initial 
step size /j'-*' = 1 and begin iteration, 
for j = 1...K 

(1) Compute the Jacobian matrix D^' and D^p 

(2) Get the steepest descent direction 

(3) Compute = gs(VW + 2^Wz[^l), 

if /(vw, v^) - /(vw, y[^-ii, b'^'i, 

> /3Wfr(Z[-'ltz[-'l), then set Z?!^' := 2/31^1, repeat Step 3. 

(4) Compute sff' = gs(VW + /J^Z^), 

if/(yw,...,y[^^'i)-./(v""',-,v'^-ii,B!i'i,v[^+ii...v[^i) 

< i/3[-'lfr(Z[-'ltzW), then set /jl^' := i/jW, repeat Step 4. 
(6) Continue till the cost function / is sufficiently small. 



Discussion: 

• (i) The inner product and the gradient direction are 
defined in different topologies in 113|. However it is 
considered to be inappropriate because the gradient is 
defined after the inner product is given only. In other 
words, the inner product and the gradient direction must 
be defined in the same topology. Our proposed algorithm 
rectifies the topology flaw in |[T3l , thus avoids the risk of 
non-convergence. 

• (ii) It can be concluded that Algorithm 1 belongs to the 
classical optimization method, which means it woks in 
multi-dimensional space C"^^ with the dimensions: 



dim(C"''P) = np 



(12) 



Obviously the algorithm complexity increases with the 
dimensions. As discussed before, optimization algorithms 
on manifolds work in an embedded or quotient space 
whose dimension will be much smaller than that of 
classical constrained optimization methods. Thus, opti- 
mization algorithms on manifolds not only have lower 



complexity, but also perform better numerical properties. 
The corresponding algorithms on manifolds will be stated 
for lA in the next two subsections. 

B. The Steepest Descent Algorithm on Complex Stiefel Mani- 
fold for lA 

Informally, a manifold is a space that is "modeled on" 
Euclidean space. It can be defined as a subset of Euclidean 
space which is locally the graph of a smooth function. 

Conceptually, the simplest approach to optimize a differen- 
tiable function is to continuously translate a test point in the 
direction of steepest descent on the constraint set until one 
reaches a point where the gradient is equal to zero. However, 
there are two challenges for optimization on manifolds. First, 
in order to define algorithms on manifolds, these operations 
above must be translated into the language of differential 
geometry. Second, once the test point shifts along the steepest 
descent direction, it must be retracted back to the manifold. 
Therefore, after reformulating the optimization problem on the 
Stiefel manifold, we introduce definitions about project opera- 
tion and tangent space for retraction and gradient respectively. 

In many cases, the underlying symmetry property can be 
exploited to reformulate the problem as a non-degenerate 
optimization problem on manifolds associated with the orig- 
inal matrix representation. Thus the constraint condition 
■^[iltij/b] — / in the cost function (|4|i, inspires us to solve 
the problem on the complex Stiefel manifold. The complex 
Stiefel manifold iflTl St{n,p) is the set satisfying 

St{n,p) ^ {X eC'^'P : X^'X = 1} (13) 

St{n,p) naturally embeds in C"^^ and inherits the usual 
topology of C"^^. It is a compact manifold and from Propo- 
sition 3.3.3 in ifTSl . we can get: 

dim{St{n,p)) = np-^p{p+l) (14) 

Another important definition is the projection. Assuming 
Y S C"^P is a rank-p matrix, the projection operator Trst{-) ■ 
£nxp _^ St{n,p) is given by 

7r,t(y)-arg min (15) 

XeSt(n,p) 

It can be proved that there exits a unique solution if Y has 
full column rank [IS]. From ( fTSl l. it can be acquired that the 
projection of an arbitrary rank-p matrix Y onto the Stiefel 
manifold is defined to be the point on the Stiefel manifold 
closest to Y in the Euclidean norm |19|. Besides, if the 
singular value decomposition (SVD) of Y Y = V\ 
then 

TTst{Y)=UI.„^j,V^ (16) 

based on the Proposition 7 in |fT9l . 

Consider X G St{n,p) and its disturbing point TTst{X + 
sY) G St{n,p) for certain directions matrix Y G C"xp and 
scalai-e G R . If F satisfies /{-KstiX +eY)) = f{X) + 0{e^) 
which means certain directions Y do not cause iTst {X + sY) 
to move away from X as e increases. The collection of such 



directions Y is called the normal space at X of St{n,p) ifTsl . 
The tangent space Txin^p) is defined to be the orthogonal 
complement of the normal space, which can be roughly 
illustrated as Fig. |2] And the mathematical expression of the 
tangent space Tx{n,p) at X G St{n,p) is defined by 

Tx{n,p) = {Z e C'^^'P : Z = XA + X^B,Ae C^^^p, 

A + = 0,B e C("-P)^P} 

(17) 

in which X± G C"^^"^''-' is defined to be any matrix 
satisfying [X X^]^ [X X±] = I and is the complement of 
X G St{n,p). Also from |T9l, it can be obtained that the 
gradient of our cost function is in the tangent space Tx{n,,p). 
And the dimension of Tx{n,p) is: 

diin{Tx{n,p))^p{2n~p) (18) 

Obviously the steepest descent algorithm requires the com- 




Fig. 2. Tangent space of Stiefel manifold 

putation of the gradient. As we previously emphasize, the 
gradient is only defined after Tx{n,p) is given an inner 
product: 

(Zi,Z2) = 3?{ir(Z^(7-ixX+)Zi)} (19) 

where Zi,Z2 G Tx{n,p) and X G St{n,p). The derivation 
of ( fT9l) can be found in 1 12|. Therefore, under the defined inner 
product, the steepest descent direction of the cost function 
f{X) at the point X G St{n,p) is 

Z = XDI^X - Dx (20) 

where Dx is the derivative of f{X). The proof of ( 12^ is in 

Inspired by fl9\ the proposed SD algorithm on complex 
Stiefel manifold is presented in Algorithm |2] From (fT9] l and 
(|20] |. it can be easily obtained that the inner product needed 
for the Armijo step rule is 

= K{tr(Z[-''l^(J- ivI^VI^l V'')} (21) 

which is used in Step 4 and 5; and the steepest descent on 
Stiefel manifold of our cost function is 

^bl =yb]_DMtyb] _£,M (22) 



which is used in Step 3. Noticing that the project operation 
7rs((-) in Step 6 (Step 4, 5) guarantees the newly computed 
solution V'"'' (or B^^\ bI^^ ) after iteration still satisfies 
V^-'^ £ St(n,p). Using the method of SVD, we can easily 
compute the project operation. 



Algorithm 2 The Steepest Descent Algorithm on Complex 
Stiefel Manifold for lA 

Start with arbitrary precoder matrices V'^', V'^', set initial 

step size /j'-'' = 1 and begin iteration. 

for j = I...K 

(1) Compute the Jacobian matrix D^^ and D^f 

(2) Then get the derivative of /: 

(3) Get the steepest descent direction 

(4) Compute B^^' = -KstiV^^^ + 213^^^ Z^^^), 

if/(vw,...,v^)-,f(yw,...,yi^-^',B^^',v[^+i'...y[^i) > 

/3[-''K{tr(Z[-'lt(7 - \V^'W^'^'^)Z^'^)}, then set ^'^1 — 2^'^1, 
and repeat Step 4. 

(5) Compute Bf = 7r,t(y[^l + /j'^lz'^l), 

if f{y^^\ ...,V^''^)-f(V^'\ ...M'-^\Bf ,V^^+^\..V^''^) < 
i/3l^lK{fr(Zl^lt(/_ivMv-[j]t)^[.7])}^then set /J^ — 
and repeat Step 5. 

(7) Continue till the cost function / is sufficiently small. 



Discussion: 

• (i) As previous stated, the algorithms in flT] and fTS] 
are performed by moving the descent step along the 
geodesic of the constrained surface within each iteration. 
A disadvantage of this method is the redundant compu- 
tational cost for calculating the path of a geodesic |19|. 
In this paper, we locally parameterize the manifold by 
Euclidean projection from the tangent space onto the 
manifold instead of moving along a geodesic, to achieve 
a modest reduction in the computational complexity of 
the algorithms. 

• (ii) Recall ( fT2b and ( fTSl l, it can be obtained that when 
we reformulate the problem from C"^^ to St{n,p), the 
dimension of the optimization problem decreases from np 
to np — ^p{p + 1). Although such dimension-dissension 
can be observed clearly, we still intend to reduce the 
dimensions of the space which the optimization algorithm 
works in. Thus the Grassmann manifold and its corre- 
sponding algorithm for lA are stated in the following 
subsection. 



C. The Steepest Descent Algorithm on Complex Grassmann 
Manifold for lA 

Notice that our cost function f{V) satisfies f{VU) = 
f{V) for any unitary matrix U. Because 

i=i 

^ P^jj[kj]y[j]jy[mjj[km 

= g['=l(V) (23) 

which means that multiplying unitary matrix U does not 
change the eigenvalues and their corresponding eigenvectors 
of the interference covariance matrix at each receiver. Thus 
our cost function / should be minimized on the Grassmann 
manifold rather than on the Stiefel manifold. This is because 
the Grassmann manifold treats V and VU as equivalent 
points, leading to a further reduction in the dimension of the 
optimization problem. Similar with the previous section, we 
firstly introduce the definition about Grassmann manifold, then 
present the project operation and tangent space of Grassmann 
manifold for retraction and gradient respectively. 

The complex Grassmann manifold Gr{n,p) is defined to 
be the set of all p-dimensional complex subspaces of C"^^. 
Grassmann manifold can be thought as a quotient space of 
the Stiefel manifold: Gr{n,p) ~ St{n,p)/ St{p,p). Quotient 
space is more difficult to visualize, as it is not defined as 
set of matrices; rather, each point of the quotient space is an 
equivalence class of n x p matrices. 

However, we can understand quotient space in this way: 
assuming X € St{n,p) is a point on the Stiefel manifold, the 
columns of X span an orthonormal basis for a p-dimensional 
quotient subspace. That is to say, if [X\ denotes the subspace 
spanned by the columns of X, then X e St{n,p) implies 
[XJ e Gr{n,p). Therefore, there is a one-to-one mapping 
between points on the Grassmann manifold Gr{n,p) and 
equivalence classes of St{n,p). From (fT4l i. it can be acquired 
that: 

dim{Gr{n,p)) = dim(S't(7i,p)) — diTn{St{p,p)) 

^p{n~p) (24) 

Let 1^ G C"^P be a rank-p matrix. The projection operator 
TTgr{-) : C"^P — > Gr{n,p) onto the Grassmann manifold is 
defined to be 



iY) = 



arg mm 

XeSt(n,p) 



\Y-X\ 



(25) 



It also can be proved that there exits a unique solution if 
Y has full column rank ITSll . From ( |25] ). it can be acquired 
that the projection of an arbitrary rank-p matrix Y onto the 
Grassmannn manifold is defined to be the subspace spanned by 
the point on the Stiefel manifold closest to Y in the Euclidean 
norm. Besides, if the QR decomposition of Y is Y = QR, 



the following equality holds: 



(26) 



The proof of ( |26] | also can be found in |19|. From ( |26] ), it is 
obvious that Trgr{Y) is the subspace spanned by the first p 
columns of Q. 

As discussed before, Grassmann manifold is a quotient 
space of the Stiefel manifold, thus its tangent space is a 
subspace of the Stiefel manifold's tangent space |18|. If X G 
St{n,p) , the tangent space T^^xii^yp) at [XJ £ Gr{n,p) of 
Grassmann manifold is: 

Tixi {n,p) = {Z e C"^P : Z = X^B, B £ C^^-p^^p} 

(27) 

Recall (fTsT l. the dimension of the tangent space T^x\ {^tP) of 
complex Grassmann manifold is: 

dim [Tyx] {n,p)) = d\va{Tx{n,p)) - dim(Tx(p,p)) 

^ p{2n - 2p) (28) 

Besides the inner product of T^x} ("iP) is given by: 

(Zi,Z2) =K{tr(Z2^Zi)}, Zi,Z2 STlxjKp), 

XeSt{n,p) (29) 



The derivation of ( |29] l can be found in |12|. Therefore, under 
the defined inner product, the steepest descent direction fiT9l 
of the cost function f{X) at the point X G Gr{n,p) is: 

Z = -{I - XX'^)Dx 



(30) 



where Dx is the derivative of f(X). 



Algorithm 3 The Steepest Descent Algorithm on Complex 
Grassmann Manifold for lA 

Start with arbitrary precoder matrices V'^', V'^', set initial 

step size /j'^' = 1 and begin iteration. 

for j = 1...K 

(1) Compute the Jacobian matrix D^^ and D^f 

(2) Then get the derivative of /: 

(3) Get the steepest descent direction 

(4) Compute B^f = TTgr{V^^^ + 2^Wzl^l), 

if/(vw,...,v^)-/(vw,..., > 

pUhriZ^^^" Z^^^), then set fi^^ ■- 2I3^\ and repeat Step 4. 

(5) Compute slf' = TTgr{V^^^ + /J^ZM), 

if/(yw,...,y[^i)-/(^'",-,v[^-ii,Bli'i,y[^+ii...vM) < 

i/3Wtr(ZW^zW), then set Z?'-'' := f^W, and repeat Step 5. 
(7) Continue till the cost function / is sufficiently small. 



ifold is presented in Algorithm |3] Similar with the previous 
proposed algorithms, the Armijo step rule is performed to find 
a proper convergence step length. From (|29] l and dSOl l. it can be 
easily concluded that the inner product needed for the Armijo 
step rule is 

'Z[^'1,Z[J1) =tr(Z[^ltz[^'l) (31) 



which is used in Step 4 and 5; and the steepest descent on 
Grassmann manifold of our cost function is 



U] 



(32) 



which is used in Step 3. And the project operation TTgr{-) 
in Step 6 (Step 4, 5) retracts the newly computed solution 
yl^l (or B^^\ B^-'l) back onto Gr assmann manifold Gr{n,p). 
Using QR decomposition, we can easily compute the project 
operation. 
Discussion: 

m Comparing dim(G'r(n,p)) — p{n — p) in (|24] | with 
d\T[v{St{n,p)) — np — \p{p + 1) in (fT4l i. a further 
dimension reduction can be observed. Similarly, from 
( |28] l we can see another advantage of using the Grass- 
mann manifold rather than the Stiefel manifold is that 
Tyx\{n,p) has only p{2n — 2p)dimensions, whereas 
tangent space of St{n,p) has p{2n — |5)dimensions. And 
from II20I . it can be obtained that in our system model, if 
each transceiver is equipped with same amount of antenna 
(M = N), then 



and 



k=l 



K ■ M 



(33) 



(34) 



Recall ([T2]i, (O and (|24|, we can get that if M is 
large enough ( M not only can represent the number of 
antennas each transceiver equipped, but also can refer to 
the number of time extension slots |T| ll20l ). hence 

dim{St{M,d)) _ 3 
4 



(35) 



dim(C 

which is a clear evidence for dimension-descension. And 

dim{Gr{M,d)) _ 1 
^ 2 



(36) 



dim(C A^^^'') 

holds for any integer M. (l36T l means that optimization 
on Grassmann manifold would reduce dimension further 
The trend of dimension-descension can be roughly illus- 
trated in Fig. [3] 

V. Numerical Results and Discussion 

Without symbol extension, the feasible condition of k-user 
interference alignment ||20| is given by: 



(37) 



Mb] + nU] >{K + 



The proposed SD algorithm on complex Grassmann man- 



For satisfying feasibility and simple computation, we consider 
a 3 -user 2x2 MIMO interference channel where the desired 



Fig. 3. Trend of dimension-descension 



DoF per user rfl^l is 1 . All the algorithms are executed under 
the same scenario including randomly generated channel coef- 
ficients, initial precoder matrices and convergence step length. 
We simulated the proposed three SD algorithms through 100 
simulation realizations. 

As shown in Fig.|4] Fig. |5] and Fig.|6l each curve represents 
an individual simulation realization and all results converge 
after 20 or more iterations, which is a clear indication of the 
algorithm convergence performance in such limited iterations. 
In order to compare the convergence performance, the average 
values of 100 realizations results are illustrated in Fig. |7] 
It can be observed that the algorithms on manifolds have 
better convergence performance comparing with the classical 
optimization method as our expectation. 

This is attributed to the reason that we reformulate the 
constrained optimization problem to an unconstrained one on 
manifolds with lower complexity and better numerical prop- 
erties; then locally parameterize the manifolds by a Euclidean 
projection of the tangent space on to the manifolds instead of 
moving along the geodesic, as stated in the previous sections. 
Moreover the convergence performance curve of SD algorithm 
on Stiefel manifold and the curve of SD algorithm on Grass- 
mann manifold almost overlap. Recall that optimization on 
Grassmann manifold would reduce dimension further, there- 
fore the SD algorithm on Grassmann manifold will guarantee 
performance and reduce the computation complexity at the 
same time. 

Meanwhile, since there are two interference signals at each 
receiver, as shown in Fig. [8] the angles between the spaces 
spanned by each interference signals asymptotically converge 
to zero within one simulation realization, which is another 
evidence for achieving the perfect interference alignment. 

Finally we compare the system sum-rate of the proposed 
algorithms. Fig. |9] shows that the SD algorithm on Stiefel 
manifold and the SD algorithm on Grassmann manifold almost 
have the same performance, and outperform the other classical 
optimization algorithms. More importantly, at high SNR the 
DoF of the three proposed algorithms nearly achieve 3, which 
is the maximum theoretical value (KM/2 — 3) . Therefore 
the perfect interference alignment is successfully achieved. 

Two reasons leading to the fact that the algorithms on 
manifolds obtain higher system capacity are presented below: 
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Fig. 4. Normalized remnant interference of the SD algorithm in complex 
space 
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Fig. 5. Normalized remnant interference of the SD algorithm on Stiefel 
manifold 
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Fig. 6. Normalized remnant interference of tlie SD algoritlim on Grassmann 
manifold 



(i) It is noticed that our cost function actually is the 
interference power spilled from the interference space 
to the desired signal space. With the better convergence 
performance, the SD algorithms on manifolds will have 
less remnant interference in the desired signal space 
within same iteration times. Therefore, the SD algorithms 
on manifolds will get higher SINR ||5|: 

signal power 



SINR:^ 



(38) 



noise + remnant interference 

which leads to high capacity. 
• (ii) At each receiver, the zero forcing filter is adopted. 
It will project the desired signal power and the remnant 
interference onto the subspace which is orthogonal with 
the subspace spanned by the interference. After perform- 
ing the SD algorithms on manifolds, it is observed that 
in the Euclidean norm distance, the subspace spanned 
by desired signal is more close to the orthogonal com- 
plement of the interference subspace. Therefore, even 
the proposed algorithms on manifolds finally get the 
same remnant interference as the classical optimization 
methods results. The algorithms on manifolds will suffer 
from less power lose during the projection operated by 
zero forcing filter, hence higher system capacity can be 
achieved. 

We notice that better throughputs may be attained by using 
non-unitary precoders, or by applying power water-filling in 
the equivalent non-interfering MIMO channels. Nevertheless, 
these methods to increase throughputs can be performed as the 
second step after the interference alignment is achieved [11|. 
Thus in this paper, we only need to concentrate on the first 
step to find the perfect solutions of interference alignment. 

VI. Conclusion and Future Work 

In this paper, we focus on the precoder designs on manifolds 
for interference alignment. By restricting the optimization only 
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Fig. 7. Convergence performance 



at the transmitters' side, it will alleviate the significant over- 
head induced by alternation between the forward and reverse 
links. A classical SD algorithm in multi-dimensional complex 
space is proposed first. Then we reform the optimization prob- 
lem on Stiefel manifold, and propose a novel SD algorithm 
on this manifold with lower dimensions. Moreover, aiming 
at further reducing the complexity, Grassmann manifold is 
introduced to derive corresponding algorithm for reaching the 
perfect interference alignment. Numerical simulations show 
that comparing with previous methods, the proposed algo- 
rithms on manifolds have better convergence performance and 
higher system capacity, also achieve the maximum DoF. 

The proof for convergence of the proposed algorithms is 
quite simple. Our cost function is non-negative with the low 
bound zero. It monotonically decreases within each iteration. 
Therefore it must converge to a solution which is very close 
to zero. However, there is no guarantee that our cost function 
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is convex 13]. Thus finding a global optimum is in our future 
work. 

We also do some simulations by using more sophisticated 
algorithm, such as Newton-type method, to achieve quadratic 
convergence. Yet, except for the increased computational 
complexity, the Newton method will converge to the closet 
critical point |fT9ll . Therefore the Newton method coupled 
with the steepest descent algorithm (a few iterations of the 
SD algorithm are performed first to move close to a local 
minimum before the Newton algorithm is applied) will be 
investigated in our future work too. 

Acknowledgment 

This work was supported by National Natural Science 
Foundation of China (No. 611711 12) and MIIT of China (No. 
2010ZX03005-001-02). 

References 

[1] V. R. Cadambe and S. A. Jafar, "Interference alignment and degrees 
of freedom of the K-user interference channel,"IEEE Trans. Information 
Theory, vol. 54, no.8, pp. 3425-3441,Aug 2008. 

[2] M. Maddah-Ali, A. Motahari, and A. Khandani, "Signaling over 
MIMO multi-base systems: Combination of multi-access and broadcast 
schemes,"/nf. Symp. Inform Theory 2006, pp. 2104-2108. 

[3] M. A. Maddah-Ali, A. S. Motahari, A. K. Khandani, "Communication 
over MIMO X channels: interference alignment, decomposition, and 
performance analysis," IEEE Transactions on Information Theory, vol.54, 
no.8, pp.3457-3470, Aug. 2008. 

[4] Cadambe, Jafar, "Reflections on interference alignment and the degrees of 
freedom of the K-user MIMO interference channel,"IEEE Inform Theory 
Society Newsletter, vol. 54, no.4, pp. 5-8, Dec 2009. 

[5] K. S. Gomadam, V. R. Cadambe, and S. A. Jafar, "Approaching the capac- 
ity of wireless networks through distributed interference aWgnmenir IEEE 
GLOBECOM, New Orleans, Louisiana, Nov 2008, pp. 1-6. 

[6] S. W. Peters, R. W. Heath, "Interference alignment via alternating mini- 
mization, " IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, 
Taipei, Taiwan, April 2009, pp. 2445-2448. 

[7] J. Raj Kumar and F. Xue, "An iterative algorithm for joint signal and 
interference alignment, "IEEE ISIT 2010, Austin, pp. 2293 - 2297. 

[8] Sheng Liu and Yinggang Du and Meng Zhao, "A new joint iterative 
transceiver design with interference alignment method, "Wireless and 
Optical Communications Conference, 19th Annual, Shanghai, 2010, pp. 
1-5. 



[9] Hui Shen and Bin Li, "A novel iterative interference alignment scheme 

via convex optimization for the MIMO interference channel "Vehicular 

Technology Conference, Ottawa, Set. 2010. pp. 1-5. 
[10] Hui Shen and Bin Li and Meixia Tao and Yi Luo, "The new interference 

alignment scheme for the MIMO interference channel "Wireless Commu- 
nications and Networking Conference, Sydney, Apr. 2010. pp. 1-6. 
[11] I. Santamaria, O. Gonzalez, R. W. Heath, and S. W. Peters, "Maximum 

sum-rate interference alignment algorithms for MIMO channels ,"/££'£ 

GLOBECOM 2010 , pp. 1-6, Dec. 2010. 
[12] A. Edelman, T. A. Arias, and S. T. Smith, "The geometry of algorithms 

with orthonormality constraints," SIAM J. Matrix Anal. Applic. , vol. 20, 

no. 2, pp.303 - 353 , 1998. 
[13] Hadi G. Ghauch, Constantinos B. Papadias, "Interference alignment: 

A one-sided approach, "IEEE Global Telecommunications Conference, 

Texas, Dec. 2011, pp. 1-5. 
[14] A. Hjorungnes, Complex-Valued Matrix Derivatives: With Applications 

in Signal Processing and Communications. Cambridge, UK: Cambridge 

University Press, 2011. 
[15] S. Boyd and L. Vandenberghe, Convex Optimization. Cambridge, UK: 

Cambridge University Press, 2004. 
[16] E. Polak, Optimization: Algorithms and Consistent Approximations, 

New York: Springer- Verlag, 1997. 
[17] Xianda Zhang, Matrix Analysis and Applications. Beijing, China: 

Springer, 2004. 

[18] P. A. Absil, R. Mahony and R. Sepulchre, Optimization Algorithms on 
Matrix Manifolds, New Jersey: Princeton University Press, 2008. 

[19] J. H. Manton, "Optimization algorithms exploiting unitary constraints," 
IEEE Trans. Signal Process. 50 (2002), no. 3, 635-650. 

[20] Syed A. Jafar, "Interference Alignment: A New Look at Signal Di- 
mensions in a Communication Network," Foundations and Trends in 
Communications and Information Theory, Vol. 7, No. 1, pages: 1-136, 
2011. 



60 I 1 1 1 1 1 r 




5 10 15 20 25 30 35 40 45 50 

Power per transmiter(dB) 



